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Abstract 

For a sample of n independent identically distributed p-dimensional centered random 
vectors with covariance matrix let denote the usual sample covariance (centered 
by the mean) and the non-centered sample covariance matrix (i.e. the matrix of 
second moment estimates), where p > n. In this paper, we provide the limiting spectral 
distribution and central limit theorem for linear spectral statistics of the Moore-Penrose 
inverse of Sn and S^- We consider the large dimensional asymptotics when the number 
of variables p —>■ oo and the sample size n —>■ oo such that p/n —c G (l,+oo). We 
present a Marchenko-Pastur law for both types of matrices, which shows that the limiting 
spectral distributions for both sample covariance matrices are the same. On the other 
hand, we demonstrate that the asymptotic distribution of linear spectral statistics of the 
Moore-Penrose inverse of differs in the mean from that of S^- 
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1 Introduction 


Many statistical, financial and genetic problems reqnire estimates of the inverse popnlation 
covariance matrix which are often constrncted by inverting the sample covariance matrix. 
Nowadays, the modern scientihc data sets involve the large nnmber of sample points which 
is often less than the dimension (nnmber of featnres) and so the sample covariance matrix is 
not invertible. For example, stock markets inclnde a large nnmber of companies which is often 
larger than the nnmber of available time points; or the DNA can contain a fairly large nnmber 
of genes in comparison to a small nnmber of patients. In snch sitnations, the Moore-Penrose 
inverse or psensoinverse of the sample covar i ance matrix can be nsed a. s an es t imator for th e 


precision mat r ix [se e, e.g., ISrivastaval (1200711 . IKnbokawa and Srivastaval (1200811 . IHovlel (1201111 . 


Bodnar et al.l (1201511 ]. 


In order to better nnderstand the statistical properties of estimators and tests based on the 
Moore-Penrose inverse in high-dimensional settings, it is of interest to stndy the asymptotic 
spectral properties of the Moore-Penrose inverse, for example convergence of its linear spectral 
statistics (LSS). This information is of great interest for high-dimensional statistics becanse 
more efficient estimators and tests, which do not snffer from the “cnrse of dimensionality” and 
do not rednce the nnmber of dimensions, may be constrncted and applied in practice. Most 
of the classical mnltivariate procednres are based on the central limit theorems assnming that 
the dimension p is hxed and the sample size n increases. However, it has been pointed ont by 
nnmerons anthors that this assnmption does not yield precise distribntional approximations for 
commonly nsed statistics, and that better approximati ons can be obtained consid ering scenarios 
where the dimension tends to inhnity as well [see, e.g., Bai and Silverstehil ( 2004 1 and references 
therein]. More precisely, nnder the high-dimensional asymptotics we nnderstand the case when 
the sample size n and the dimension p tend to inhnity, snch that their ratio p/n converges 
to some positive constant c. Under this condition th e well-known Marchen k o-Pas t nr eqnation 


as we ll as Marchenko-Pastnr law were derived [see, Marcenko and Pastnr ( 1967ll . Silverstein 


(119951 1]. 

While most anthors in random matrix theory investigate spectral properties of the sample 
covariance matrix S„ = ;^ Yll=iyiy'i (here yi,... , Vr, denote s a sample of i.i.d. p-dimensional 
random vectors with mean 0 and variance Sn), Panl ( 2ni4h stndies the differences occnrring 
if S„ is replaced by its centered version S„ = ~ y)(yj “ y)^ (here y denotes the 

mean of yi,..., y„). Cor responding (asympt otic) spectral properties for the inverse of S„ have 
been recently derived by IZheng et ahl (1201311 in the case p < n, which correspond to the case 
c < 1. The aim of the present paper is to close a gap in the literatnre and focnssing on 
the case c G (l,cxo). We investigate the differences in the asymptotic spectral properties of 
Moore-Penrose inverses of centered and non-centered sample covariance matrices. In particnlar 
we provide the limiting spectral distribntion and the central limit theorem (CLT) for linear 
spectral statistics of the Moore-Penrose inverse of the sample covariance matrix. 

In Section |2] we present the Marchenko-Pastnr eqnation together with a Marchenko-Pastnr law 
for the Moore-Penrose inverse of the sample covariance matrix. Section [3] is divided into two 
parts: the hrst one is dedicated to the CLT for the LSS of the psensoinverse of the non-centered 
sample covariance matrix while the second part covers the case when the sample covariance 
matrix is a centered one. While the limiting spectral distribntions for both sample covariance 
matrices are the same, it is shown that the asymptotic distribntion of LLS of the Moore-Penrose 
inverse of and differ. Finally, some technical details are given in Section HI 
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2 Preliminaries and the Marchenko-Pastur equation 

Throughout this paper we use the following notations and assumptions: 


• For a symmetric matrix A we denote by Ai(A) > ... > Ap(A) its ordered eigenvalues 
and by the corresponding empirical distribution function (e.d.f.), that is 


where !{■} is the indicator function. 

• (Al) Let be a p X n matrix which consists of independent and identically distributed 
(i.i.d.) real random variables with zero mean and unit variance. 

• (A2) For the latter matrix X„ = we assume additionally that < oo 

for some 5 > 0. 


• By ^ 

Y„ = s|x„. 

we define a pxn observation matrix with independent columns with mean 0 and covariance 
matrix 0 It is further assumed that neither s| nor X„ are observable. 

• The centered and non-centered sample covariance matrix are denoted by 


s„ = i(Y„ - yl')(Y„ - yl')'= A.Y; - yy' 

n n 

1 1 i i 

_ V V' _ ^2 Y Y' Y2 

n — ^ n ^ n — ’ 

n n 


where 1 denotes the u-dimensional vector of ones and y = ^ '^^=i T*- corresponding 
e.d.f.’s are given by and F®", respectively. 


The Moore-Penrose inverse of a p x p matrix A is denoted by A"*" an d froni its definition 
must satisfy the following four criteria [see, e.g.. lHorn and JohnsohnI fjl985l) ] 


(i) AA+A = A, 

(ii) A+AA+ = A+, 

(hi) AA+ is symmetric, 
(iv) A+A is symmetric. 


It is w orth pointing out that the generalized inverse considered recently by iRodnar et al 
(120151 ) does not satisfy the conditions (iii) and (iv) presented above. If the matrix A 
has a full column rank then the matrix A'A is invertible and the Moore-Penrose inverse 
obeys a simple representation given by 


A+ = (A'A)“‘A'. 


( 2 . 1 ) 


^ We could easily include the population mean vector into the model but it will only make the formulas for 
weak convergence more complex not the analysis itself. 
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• For a function G : R —)■ R of bounded variation we introduce the Stieltjes transform 


mciz) 


+ 0O 


—oo 


Z e C+. 


Remark 1. Although assumption (A2) requires the existence of moments of order 4 + we 
suspect that the results of this paper also hold under the existence of moments o f orde r 4. For 
a proof one would have to use truncation techniques as provided by Bai et ah ( 20071) for the 
matrix l/nY^Y„. These extremely technical details are omitted for the sake of brevity and 
transparency. 


In this paper we are interested in the asymptotic properties of the empirical distribution function 
and linear spectral statistics of the eigenvalues of the Moore-Penrose inverse of the matrices 
S„ and S„. Actually, the limiting spectral distribution of both matrices coincide because they 
differ only by a rank one perturbation. This is shown in the following Lemma 12.11 

Lemma 2.1. Let and be the Moore-Penrose inverses of centered and non-centered sample 
covariance matrices, respectively, then 

2 

||_pS+_ _pS+|| ^ /2.2) 

p 

where 11(71100 is the usual supremum norm of a function : R —)■ R. 


Proof. We obtain for the rank one update of the Moore-Penrose inverse [see iMeverl fjl973l) ] 


S+ = (S„-yy')-^ = S+- 


Snyy'(Sn)^ +(s+)Vy'(s+) , y'(s+)V 


y'(s+)2y 


+ 


rS+yy'S 


/Q + 

n 


(2.3) 


(y'(SJ:)^y)^ 

With the notation u = S^y and v = (S+)^y the difference of the Moore-Penrose inverses can 
therefore be rewritten as follows 
uv' -|- vu' 


= 


u'v 




uu 


u'u vu'v 


u'u 


-vv — 


-vv 


■■u'v 


Lu'u 

/ 


1/2 


u — V 


u'u 


LU'VJ 


1/2 


■"u'v 


LU'UJ 


1/2 


"u'u 


u — V 


LU'V 


l/2x / 


WW , 
f 1~ 


where 


U'U 


w = 


(2.4) 



1/2 


U V 

u u 

u — V 

u'u 

u'v 



1/2 


Thus, t he difference — S / t is a matrix at most of rank 2 and the rank inequality in Theorem 
A.43 of Bai and Silverstein (2010) yields the estimate (12.2p . □ 


We now study the asymptotic properties of the e.d.f. of the spectrum of the Moore-Penrose 
inverse of the sample covariance matrices S„ and S„. As a consequence of Lemma 12.11 and 
equality (12.ip the asymptotic properties of the e.d.f. of both Moore-Penrose inverses can be 
studied concentrating on the matrix 

Si = {l/nY„YX = [( 1 / v 1 ^ Y „)+]'( 1 / v ^ Y „)+ = l / nY „( l / nY ;. Y „)- W ;, 

together with the corresponding e.d.f. . Indeed, because (-s) tends almost surely 

to the solution of the Marchenko-Pastur equation we obtain the following result. 
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Theorem 2.1. Assume that (Al) holds, ^ c G (1, oo) as n ^ oo and that converges 
weakly to a cumulative distribution function (c.d.f.) H. Then the e.d.f. converges weakly 
almost surely to some deterministic c.d.f. P whose Stieltjes transformation mp satisfies the 
following eguation 


+00 


mp{z) = — I 2 — c 
z 


-1 


dH{T) 


ZTc{zmp{z) + 1) — 1 


Proof. Let S„ = be the eigenvalue decomposition of the matrix S„, then 




((U„D„U;.)+ - zl) 


-1 


1 

= -tr 
P 




p — n 
P 


n 1 


1 \ ILL 

— ) H-/ 

r / n -n f ^ 


z) Ai((l/nY;Y„)-i) 


( 2 . 6 ) 


The last two equalities follow from the fact that the spectrum of the matrix differs from that 
of (1/nY^Y^)”^ by exactly p — n zero eigenvalues. For the Stieltjes transform {z) 

in this expression we get 




-lU = 


i_ 

” tr ^r"(i/"YiY») - z 

1 G- a.(i/«y;y„) 

nz ^ \,{l/nY'„Y„) - i 


1 1 




z ^ A.(l/nY;Y„) - i 


11 /I 




( 2 . 6 ) 


Combining the identities fl2.5p and fl2.6p provides an equation which relates the Stieltjes trans¬ 
forms of F®" and , that is 


n 1 


m 




(z) = - 


p z^ 


( 1 /.-). 


(2.7) 


It now follows from iBai and SilversteinI f 2010l j that as p/n —)■ c > 1 the e.d.f.’s and 

p^i/nY„Y; converge weakly almost surely to non-generate distribution functions F and F with 
corresponding Stieltjes transforms satisfying the equation 

1 — c 

mpfz) = -h cmp^z). 


Consequently, we have from fl2.7p almost surely 

^fs+{z) — >mp{z) := ~-c-^^{z{c-l) + cmF{l/z)) 


z z^ 
2 - c-^ 


mF(l/z) 


z 
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as n oo, where mi?(l/z) is the Stieltj es transform of th e limiting distribution F of the e.d.f. 
of S„, which satishes the equation (see, Silverstein f 19951) i 


H-oo 


mF{l/z) = 


dH { T ) 


r(l — c — c 


mFd/f) \ 1 


( 2 . 8 ) 


Thus, z{c ^ — 2 — zmp { z )) = mp { l / z ) must satisfy the same equation (12.81) which implies 

+ 00 

z { c ~^ — 2 — zmp { z )) = 


dH{T) 


r(l — c — c 


z{c ^—2—zmp{z))\ 1 


After some simplihcation the result follows. 


□ 


Corollary 2.1. //S„ = cr^I„ and the assumptions of Theorem \2.1\ are satisfied, then the e.d.f. 
of Sn converges weakly almost surely to a deterministic distribution function P with Stieltjes 
transform 


1/ —\lzF (c— l)cr^ + \/{\jz — ca'^ + — A/za‘^\ 

- ) ■ 

Moreover, the limiting distribution is given by 

F = {1 — c“^)(5o + v{x)dx 
where 5a denotes the Dirac measure at the point a G IR and 

- l/xfil/x-\Z^) 


v(x) = 


2'K(t‘^x 
0’ 

with A+ = cr^(l + \/c)^ and A_ = <7^(1 — y/cfi 


, (re[A+\A_^] 
otherwise, 


3 CLT for linear spectral statistics 

For a (random) symmetric matrix A with spectrum Ai(A),..., Ap(A) we consider the linear 
spectral statistic 

+ 0O p 

F^i9)= f 9{x)dF^{x) = -'^g{\i{A)) 

L 

—oo 

where : K, —)■ K, is a given test function. In the next two sections we will investigate the 
asymptotic properties of F®" and F®". 

3.1 Linear spectral statistics of S^ 

In the following discussion we consider the random function 

G„(x) = p(F=" - pr)(x ), 

where Pf is a hnite sample proxy of the limiting spectral distribution of S^, namely F. Note 
that the function Pf is constructed simply by substituting p/n for c and = F^" for H into 
the limiting distribution F. 
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Theorem 3.1. Assume that 

(i) (Al) and (A2) hold. 

P 

(ii) - )■ c G (1, cxd) as n ^ oo. 

n 

(in) The population covariance matrix is nonrandom symmetric and positive definite with 
a bounded spectral norm and the e.d.f. Hn = converges weakly to a nonrandom 
distribution function H with a support bounded away from zero. 

(iv) gi,... ,gk are functions on R analytic on an open region D of the complex plane, which 
contains the real interval 


[O, limsup Amax(S„^)/(l - \/c)^] 

n—>-oo 


(v) Let Bi denote a p-dimensional vector with the i-th element 1 and others 0, and define 

n,(z) = e',i:l/^mF(l/z)S„ + l)-'-E'Pe„ 

X.iz) = e',Biy(rnni/z)-B„ + l)-^-BlPe, 

as the ith diagonal elements of the matrices 

respectively. The population covariance matrix satisfies 


lim -y^Ki{zi)Ki{z2) = hi{zi,Z2) 

1—Vrv-, n < ^ 


n^oo Ti 


2=1 


lim - y^Ki{z)xi{z) = h2{z) 

n—)-oo n ^ 


2=1 


then 


+ 00 


+ 00 


gi { x ) dGn { x ),..., I gk { x ) dGn { x ) \ ^ ..., , 


where (X^^,... is a Gaussian vector with mean 


+00 




EiXg) = — 


1 r g{z) -L OXmMTJW ^ ’ 




+ 


27ri J z'^ 

E{Xfi) - 3 r g{z) c{mF{l/z)fh 2 {z) 


■dz 


27ri 


— OO — 


dz 


and covariance function 


COV{Xg„Xg,) = 


1 r r g{zi)g{z2) m'p{l/zi)m'p{l/z2) 


ydzldZ2 


(3.1) 


2vr2 / J zfz2 (mpfl/zi) — mf^{l/z2))‘ 

[>T^Fil/zi)mF{l/Z2)hi{zi, Z2)Y dzidz2 ■ (3.2) 
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The contours in LS. il) and liS. S\} are both contained in the analytic region for the functions 
Qi,... ,gk and both enclose the support of Pf for sufficiently large n. Moreover, the contours in 
are disjoint. 


P roof. The proof of t his th eorem follows comb ining argum ents from the proof of Lemma 1.1 
of Bai and Silverstein (2004) and Theorem 1 of Pan f 2014h . To be precise, we note hrst that 
(^) converges almost surely with limit, say mpf^z). We also observe that the CLT of 
p(m^g+(^) — mp{z)) is the same as of z) — m^{l/z)) and proceeds in two 

steps 


(i) Assume hrst that = 3. By Lemma 1.1 of 

the process 

Tl 

-^(mpi/nY;Y„(l/^) -mpil/z)) 

dehned on an arbitrary positively oriented contour C which contains the support of Pf 
for sufficiently larger n converges weakly to a Gaussian process with mean 


Bai and Silverstein (20041 we get that 


1 


J (l+tmjp(l/z))^ 


dH{t) 


z 


2 



t^imp(l/z)p 

{l+tmp_{l/z)) 



2 


and covariance 

1 m'p{l/zi)m'p{l/z2) 1 

^ 1^2 {mFil/z2) - mFil/zi))"^ {zi - Z2y 

In order to obtain the assertion of Theorem 13.11 we use again the argument made at the 
beginning of the proof and the following identity 


CXD 

J g{x)dGnix) 

— CXD 


1 

27ri 




(3.3) 


which is valid with probability one for any analytic function g dehned on an open 
set containing the support of Gn if n is sufficiently large. The complex integral on 
the RHS of fl3.3p is over certain positively oriented contour enclosin g G on which the 
function g is analytic [see the discussion in iBai and Silverstein! (1200411 fol l owing: Lemma 
1.1)]. Further, following the proof of Theorem 1.1 in iBai and Silverstehil ( 2004 1 we ob¬ 
tain that (/ gi(x)dGn(x),..., f gk(x)dGn(x)J converges weakly to a Gaussian vector 
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In the case ^ 3 we will use a result proved in Pan and Zhou ( 20081) . more precisely 

Theorem 1.4 of this reference. Here we hnd out that in the case ^ 3 there appears 

an additional summand in the asymptotic mean and covariance which involve the limiting 
functions hi{zi,Z 2 ) and h 2 {z) from assumption (v), namely for the mean we obtain the 
additional term 

E{Xf^) - 3 c{mF{l/z)fh2{z) 


H-cxd 

1-^/ 
—oo 


{l+tmpil/z)P^^ Wi 


and for the covariance 


E(Xt) - 3 


~2„2 

^1^2 


[mF{l/zi)mFil/z2)hi{zi, Z 2 )]' 
















































These new summands arise by studying the limi ts of products o f E(Xt ) — 3 and the diag¬ 


onal elements of the matrix (S„ — l/^I) ^ [see, Pan and Zhou f 2008[l . proof of Theorem 


1.4], Imposing the assumption (v) we basically assure that these limits are the same. 
The assertion is now obtained by the same arguments as given in part (i) of this proof. 


□ 


A version of Theorem 13.11 has been proved in IZheng et al.l (120131) for the usual inverse of the 
sample covariance matrix S~^. It is also not hard to verify the CLT for linear spectral statistics 
of in the case c < 1 with the same limit distribution. Theorem 13.11 shows that in the 
case c > 1 there appear additional terms in the asymptotic mean and covariance of the linear 
spectral statistics corresponding to S+. 


In general, as we can see, the results on the limiting spectral distribution as well as the CLT 
for linear spectral statistics of follow more or less from the already known hndings which 
correspond to the matrix 1/nY'^Yn- In the next section we will show that the general CLT for 
LSS of the random matrix is different from that of S^. 


3.2 CLT for linear spectral statistics of S^. 

The goal of this section is to show that the CLT for linear spectral statistics of the Moore- 
Penrose inverse differs from that of S^. In order to show why these two CLTs are different 
consider again the identity (12.311 . that is 


S+ = (S„ - yy')+ = 


S+yy'(S+)2 + (S+)V yXS+) ^ y'(S+)V 


y'(s+)^y 


(y'(S+)2y)2 


s+yy's^ 


We emphasize here the difference to the well known Sherman-Morrison identity for the inverse 
of a non-singular matrix S^, that is 


C-I _ I'O _ -I- 

yy) -h„ +J_y,g__,y 

Sliprman anH Morrison (l^Hy)]. Note that it follows from the identity 


[see. 


(3.4) 


1 - y'Ky = 1 - -tl'Y'(l/yS)Y(l/nY'Y)-^(l/v^)Y'Yl = 1 - — = 0, 

n 


that the right-hand side of the formula (13.4p is not dehned for the Moore-Penrose inverse in 
the case p > n. 

Now consider the resolvent of namely 


R(^) = ( ) = ([S„ - yy'^ 


zl 


-1 


, o+ _ S+yyXS+)^ + (S+)VyXS+) y'(S+)V ^ 


= A(z)-—W + 


nstn 
1 


(y'(S+)2y)^ 


-ww 


-1 


u'v u'u 

where we use (12.411 and the notations u = S+y, v = (S+)^y and A{z) = — zl and 


w = 


u'v 


1/2 


u 


u'u 


1/2 
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to obtain the last identity. A twofold application of the Sherman-Morrison formula yields the 
representation 


R(z) = A(z)- — 


vv 


u'v 


(AW-^vv')-‘^{A(.)-^vv') 


-1 


1 + (A(A - ^vv') ^ 

u'u V V / u'v / 


W 


= A 




1 - ^v'A-i 

u'v 


(2)V 


1 + (A(2) - t;I^vv') 


w 


Taking the trace of both sides of fl3.5p we obtain the identity 


P^F^t (^) 


tr(R( 2 :)) =pm^s+(^) 


VA-\z)y 
u'v — v'A“ 1 ( 2 :)v 


w 


(a-‘W + 


A-h2)vv'A-ip) 

u'v-v'A-1(2)v 


2 

W 


u'u + w'A“^(z)w + 


(w'A-l(z)v)2 ’ 

u'v-v'A-1(2)v 


(3.6) 


which indicates that the CLTs for linear spectral statistics of Moore-Penrose sample covariance 
matrices of and might differ. In fact, the following result shows that the last two terms 
on the right-hand side of fl3.6p are asymptotically not negligible. 


Theorem 3.2. Let Gn{x) = p{F^" (g) — and suppose that the assumptions of Theorem 

ro are satisfied, then 


+ 00 


gi{x)dGn{x), 


+ 00 

I gk{x)dG^{x)y 


—oo 


—oo 


where ..., Xgfi)' is a Gaussian vector with mean 


E[Xg) = - 


27r* 


9{z) 


+ 00 
^ / 


(mpil/z)Y 
(l+tm_F(l/z))3 


dH{t) 


+ 00 


-dz — 


(1-cf 




dH{t)y 


EjXfi) - 3 

27ri 


g[z)c{mF{l/z)fh2{z) 


+ 00 


-dz 


1 - e / 


(l+tm£(l/z))2 


1 / g{z) ntf (l/z) 

2^1 J z^ mpfl/z) 

and covariance function 


dz 


(3.7) 


COV[Xg-^ , Xg2 ) - 


1 


27r2 7 7 


7 g{zi)g{z 2 ) m'pjl/zi)m'p{l/Z 2 ) 


ztz. 


dzidz2 


1 / 2 : 2 )) 

“ [mF{l/zi)mF{l/z2)hi{zi,Z2)]" dzidz2 . (3.8) 

The contours in 1^3.1\) and LS.Si) are both contained in the analytic region for the functions 
gi,... ,gk and both enclose the support of Pf for sufficiently large n. Moreover, the contours in 
\3.8\) are disjoint. 


Before we provide the proof of this result we emphasize the existence of the extra summand in 
the asymptotic mean. It has a very simple structure and can be calculated without much effort 
in practice. Indeed, the last integral in fl3.7p can be rewritten using integration by parts in the 
following way (see. Section 4.1 for detailed derivation) 


2m 7 mpfl/z) 


b 

J g'{x) w:g[mF{X/x)]dx , 

a 


(3.9) 


10 


























where m^{l/x) = lim 771^(1/^) for x G K, and the interval (a, 6) contains the snpport of P. 

Z^X 

On the other hand, the asymptotic variances of the linear spectral statistics for centered and 
no n-centered sample covariance matrices coincide. For a discnssion of assnmption (v) we refer 


to 


PanI fl2ni4J) . Remark 2, and other references therein. 


Proof of Theorem 13.21 The proof of Theorem 13.21 is based on the Stieltjes transform method 
and cons i sts of a combination of argnments similar to those given bv lBai and SilversteinI (1200411 
and iPanI (120141 1. First, by the analyticity of the fnnctions gi,... ,gk and (13.3p it is snfficient to 
consider the Stieltjes transforms of the spectral e.d.f. of sample covariance matrices. Fnrther- 
more, recall from (13. 6 p that the Stieltjes transform of the e.d.f. of can be decomposed as 
the snm of the Stieltjes transform of the e.d.f. of Sjjj and the additional term 


w 


^n{z) = 


-1 


/ N A-ip)vv'A-i(z) 
\.Z) -r u/v_v'A-l(z)v 


W 


v'A-2(^)v 
u'v — v'A~^(2;)v u'u + w'A~i(z)w + 


(3.10) 


involving sample mean y and S^. Thus, it is sufficient to show that this random variable 
converges almost surely on C"*" = {z G C : Q'z > 0} to a nonrandom quantity as p/n —>■ c > 1 
and to determine its limit. As a result. The orem 13.21 follows fro r n Slu tsky’ s theorem, the 
contin uous mapping theorem and the results in Bai and Silverstein ( 2004tl and Pan and Zhou 

(I2nn8h . 

It is shown in Section 14.21 that the function in (I3.10p can be represented as 


yJ + 2zeniz)+zXiz) 
1 + zy'y + z^Qn{z) 

where the functions Qn is given by 


in{.z) = - 

2; 


(3.11) 


1 


1 1 


UA = —y'y + —iU(i^y;yo-' - zi)-% 


z zn 

As a consequence, the asymptotic properties of can be obtained analyzing the quantity 


(3.12) 


Vn{Z) 






z\ 


1-1- 


n 


= __tr [(l/nY;.Y„)(l/nY;.Y„ - l/2l)-'l/nl„iy 

‘Z/ 

= -- - 4tr [(l/nY^Y, - l^I)-'©,] , 


where we use the notation ©„ = l/?7,l„l(j. It now follows from Theorem 1 in iR.nbio and Mestre 
(l2nnh that 

|tr [(l/nY(,Y„ - 1 / 2 ;I)~^©„] -x„(l/ 2 ;)| —^0 a.s., 
where x„(l/ 2 :) is a unique solution in C"*" of the equation 


Note that tr (©n) = 1 and that Theorem 1 in iR.nbio and Mestrel (120111) is originally proven 
assuming the existence of moments of order 8 + 5. However, it is shown in iBodnar et al.l (120151) 
that only the existence of moments of order 4 + 5 is required for this statement. 
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In order to see how x„(l/z) relates to m^{l/z) we note that due to assumption (iii) Hn —)■ H 
as n —>■ cxo and, thus, Xn{)-/z) x{l/z). This implies 


1 

Xn{l/z) 


+ llz 


^tr {xn{l/z)l + i:^^) 



TdHnjr) 
Xn{l/z)T + 1 



rdHij) 
x{1/z)t + 1’ 


which leads to 

OO 

— OO 

The last equation is the well-known Marchenko-Pastur equation for the Stieltjes transformation 
m^(l/ z) of the limiting distribution F. Because the solution of this equation is unique we obtain 


x{l/z) = mpi^/z). 

As a result we get the following asymptotics for 1 -|- zf'y + z^6n{z) as n ^ oo 

mF{l/z) 


which ensures that 


in{z) 


1 


I + zy'y + z^9n{z) 


a.s., 


m,p{llz) 


z m^{l/z) z‘^m^{l/z) 


a.s. 


(3.13) 


for p/n —)■ c G (l,-|-cx)) as n —)■ cx). The assertion of Theorem 13.21 now follows taking into 
account the argument made at the beginning of the proof and fl3.13l) . □ 


4 Appendix 


4.1 Derivation of (l3.9ll 


We select the contour to be a rectangle with sides parallel to the axes. It intersects the real 
axis at the points a 7 ^ 0 and b such that the interval (a, b) contains the support of P. The 
horizontal sides are taken as a distance yo > 0 from the real axis. More precisely, the contour 
C is given by 


C = [a + iy : \y\ < yo}u{x + iyo : x G [a,b]}u{b + iy : \y\ < yo}[j{x-iyo : x G [a,b]} (4.1) 

so that (a, 6 ) contains [O, limsup^^j^ Ama 3 ;(S“^)/(l — \/c)^] and is enclosed in the analytic 
region of function g. We calculate the four parts of the contour integral and then let yo tend 
to zero. First, we note that 


d 1 m'p(l/z) 

Then using integration by parts the last integral in fl3.7p becomes 


1 

27ri 


g(z)m'p{l/z) 

niF^l/z) 


dz = 


1 

2'xi 
1 

2711 

= -i^.j>g\z){\og\mF{l/z)\Pia.ig{mF{l/z)))dz, (4.2) 


g{z)d{\ogmF{l/z)) 
g\z) XogrriFii)-/z)dz 
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where any branch of the logarithm may be taken. Natnrally extending the Stieltjes transform 
on the negative imaginary axis and using z = x + iy we get 


\mF0-/z)\ = 


dF{X) 


\ — Ij z 


< 


dF{\) 


dF{X) 


+ y‘^ 


J /(-y _ x \2 I \y\ 

— OO Y ' \z\^ > \z\'^ 


(4.3) 


Next we note that any portion of the integral 04.21) which involve s the vertical side c an be 
neglected. Indeed, using 04.3p . the fact that \g'{z)\ < K and (5.1) in iBai and SilversteinI (120041) 
for the left vertical side we have 


< 


< 


1 

27ri 

K 


yo 


g'{a + iy) logmF(l/(a + iy))dy 


-yo 

yo 


{log \mF{I/{a + iy)) \ + | arg(m^(l/(a + iy))) |) dy 


K 

K 

71 


-yo 

yo 


aA + y"^ , 

log — 1 -|- Vn]dy 


-yo 

yo 


a? + 

log^ALdy + Kyo 

y 


— ( yo log — -Yo + 2a arctan(Yo/a) ) + Kyo , 


TT 


Yo 


(4.4) 


which converges to zero as Yo 0. A similar argument can be used for the right vertical side. 
Consequently, only the remaining terms 


1 

1 


[g\x + iyo)] log \mF{l/{x + iyo))\dx - tt + ^yo)] arg[mF(l/(x + iyo))]dx 


27r 

1 


[g'{x - iyo)] log |mF(l/(x - iyo))\dx - ^ j ^ “ Wo)] arg[mF(l/(x - iyo))]dx 


have to be considered in the limit of the integral 04.2p . Similarly, using the fact that 
sup \'^h{x + iy)\ < K\y\. 

x£[a,b] 


(4.5) 


for any real-valued analy tic function h on the bounded interval [a, b] [see equation (5.6) in 
Bai and Silverstein f 20041 )] . we obtain that the first and the third integrals are bounded in ab¬ 


solute value by 0 (yo fog Yo thus, can be neglected. As a result the dominated convergence 

theorem leads to 


2711 

which proves 03.Op . 


g'{z) {\og\mF{l-/z)\ + iaxg{rnF{l-/z))) dz —V -/ g'{x) axg\mF{l-/x)]dx , (4.6) 

~ ~ yo^o tt / 
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4.2 Proof of the representations (13.111) and (13.121) 

For a proof of fl3.1ip we introduce the notations 


1 , (u'v)l/2 

a = , . , b =-u . 

11^11 


Then 

Cn(2) = 


(u'u)i/ 2 ^ = b — a, and we obtain for fl3.10|) 

a'A-^(z)a 


u'u 


1 — a'A~^(z)a 
(b-a)' (a-^(z 


/I ^ ^(- 2 :)aa'A ^( 2 :) +A ^( 2 :)aa'A ^(z) ^ a'A ^(z)a(A ^(z)aa'A ^(z)) 


1 — a'A^i(z)a 


(1 — a'A~i(z)a)^ 




A tedious but straightforward calculation now gives 


i-a(z) = 


a-A-^(z)a + a-A-^(z)a(b - a)'A-i(z)(b - a) + 

1 — a'A~i(z)a + (1 — a'A~i(z)a)(b — a)'A~i(z)(b — a) + ((b — a)'A~i(z)a)2 


(b-a) 


(1 - a'A-Tz)a)(b - a)'A-2(0)(b - a) 


1 — a'A~i(z)a + (1 — a'A~i(z)a)(b — a)'A~i(z)(b — a) + ((b — a)'A~i(z)a)2 


2(b - a)'A-2(z)a(b - a)'A-i(z)a + 


1 — a'A~i(z)a + (1 — a'A~i(z)a)(b — a)'A^i(z)(b — a) + ((b — a)'A~i(z)a)2 

_a'A-^(z)a[b'A-i(z)b] - b'A-^(z)b + 2a'A-^(z)b_ 

1 + b'A~i(z)b — 2a'A“i(z)b — a'A^i(z)a[b'A“i(z)b] + (a'A^i(z)b)2 

_a'A-i(z)a[b'A-^(z)b] - 2a'A-^(z)b[a'A-i(z)b]_ 

1 + b'A~i(z)b — 2a'A~i(z)b — a'A~i(z)a[b'A~i(z)b] + (a'A~i(z)b)2 


/A-^(z)a[b'A-i(z)b] - b'A-^(z)b[l - a'A-i(z)a] + 2a'A-^(z)b[l - a'A-^(z)b] 
[a'A~i(z)b — 1]2 + b'A~i(z)b[l — a'A~^(z)a] 


(4.7) 


Now note that ^n{z) is a non-linear function of the quantities a'A“^(z)a, a'A~^(; 2 )a, b'A“^(z)b, 
b'A“^(z)b, a'A"^(z)b and a'A“^(^)b, which can easily be expressed in term of y and S^. For 
example, we have 

a'A-^(z)a = w77^^^y'(S+)2(S+-2;I)-^(S+)2y 


y'(S+)3y' 

1 

y'(S+)3y 

1 

y'(S+)^y 

1 

y'(S+)^y 

1 

y'(s+)^y 

1 

y'(s+)^y 


(y'S+(S+-zI + zI)(S+-;^I)-^(S+)2y) 

(y'(s+)^y + ^y'(s+ -zi + ^i)(s+ - zi)-^(s+)2y) 

(y'(S;t)^y + zy'(S+)^y + ^^y'(S+ - zl)~\S:^ - zl + ^I)S+y) 
(y'(s;t)^y + ^y'(s+)^y + ^^^y's+y + ^V(s+ - -zi + zi)y) 

(y'(S+)"y + ^y'(s+)^y + + ^Vy + /y'(S+ - ;^I)-'y), 
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where the last equality follows from the fact that y^S^y = 1. In a similar way, namely adding 
and subtracting zl from in a sequel we obtain representations for the remaining quantities 
of interest 


a'A ^(z)a = y/(-g+) 3 y (y'(Sn)V + 2z + 3zVy + 4zV(S+ - ^I) + zy(S+- zl) V) 

b'A-^(z)b = ^l^^^(l + zy'y + ^V(S+-zI)-iy) 
b'A"^(^)b = y^^^^4^(yy + 2zy'(S+-^I)-^y + ^V(S+-^I)"^y) 


(y'isyyr 

a'A-\z)h = ^77^:^(y'(S+)2y + z + zVy+ zV(S+-zI)-^y) 

yi^nj y 

a'A-^(z)h = —^;^(l + 2zyy + 3zy(S+-zI)~^y + zy(S;l-ziyy). 

y (^nj y 

In the next step we substitute these results in 04.71) . More precisely introducing the notations 

^ni^) = y'(S+ - M)~^y, an = l/y'(S+)^y, 

we have 0n(^) = ^On{z) = y'(S^ — 2 ;I)“^y and obtain for ^z) the following representation 


yz) = 


+ 


+ 


_ aljan^ + 2z + Sz'^y'y + Az^9n{z) + z'^9'n{z)){l + zy'y + z‘^9n{z)) _ 

ay {I + zy'y + z‘^en{z)Y - ayun^ + z + z^y'y + z^en{z)){l + zy'y + z'^Oniz)) 

_ «n^(y^y + 2z6>n(z) + z^6>^(z))(a~^ + z + z^y'y + z'^dniz)) _ 

ay{l + zy'y + z26'n(z))2 - alz{an^ + z + z^y'y + z36'„(z))(l + zy'y + z26'n(z)) 

_ 2q:^z( 1 + 2zy'y + 3z^6*n(z) + z^6*^(z))(l + zy'y + z^6>n(z)) _ 

ay {I + zy'y + z‘^en{z)f - ayan^ + z + z^y'y + z^en{z)){l + zy'y + z26l„(z)) 

«n ((1 + ^y'y + z^en{z)){a-^ - Z^(y'y + 2zen{z) + Z^e'njz)))) 

-an^(l + 2 :y'y + z'^6n{z)) 

a^z(y'y + 2z6'„(z) + z26'^(z))(a“^ + z(l + zy'y + z^0ri(z))) 


-anz(l + zy'y + z^0n(z)) 

In order to simplify the following calculations we introduce the quantities 
y(z) = 1 + zy'y + z^Oniz)] y(z) = y'y + 2z0n(z) + z^6'^(z), 


which lead to 

^ / N _ «nV’n(2:)(tt"^ - zyy)) + alzy{z){a-^ + zy{z)) 

-anzMz) 

ala-^iyz) + zjj'y)) ^ yjz) + z<(z) 

-anZi)n{z) zyz) 

1 <(z) 1 y'y + 2zg.(z) + zX(^) 

z y{z) z 1 + zy'y + z26'„(z) 

Finally, we derive the representation 03.12p 9n{z) using Woodbary matrix inversion lemma [see. 
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e.g., iHorn and .Tohnsohril (119851) ]: 


ffn(z) = y'(S* - zl) V 

= y'(l/iiY„(l/nY;Y„)-X-zI)^‘y 

= y' - 4l/nY„(Y;Y„)->(I - i(Y;Y„)-‘)-‘(Y;Y„)-'Y;) y 
= --y'y - 4y'l/"Y„(l/nY:,Y„)-'(I - l(l/nY:,Y„)-')-‘(l/>iY;.Y„)-'Y;,y 

Z Z 

= --y'y - 4-lUl/>iY;.Y„)(l/nY;,Y„)-'(I - l(l/nY;,Y„)-‘)-‘(l/nY;.Y„)-‘(l/nY:,Y„)l 

z z^ n z 

= --y'y - 4-i»(" - i(i/nY;,Y„)-')-‘i„ 

^ z^ n z 

= - Vy + 

^ zn 
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